Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 64404 |
| Missing cells | 716890 |
| Missing cells (%) | 44.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 50.8 MiB |
| Average record size in memory | 826.8 B |
Variable types
| Categorical | 12 |
|---|---|
| Numeric | 12 |
| Unsupported | 1 |
MarcaVehiculo__c has constant value "97.0" | Constant |
MdeloVehiculo__c has constant value "999.0" | Constant |
n_prod_prev is highly correlated with total_siniestros and 1 other fields | High correlation |
total_siniestros is highly correlated with n_prod_prev and 2 other fields | High correlation |
total_pagado_smmlv is highly correlated with n_prod_prev and 2 other fields | High correlation |
anios_ultimo_siniestro is highly correlated with total_siniestros and 1 other fields | High correlation |
Activos__c is highly correlated with AnnualRevenue and 1 other fields | High correlation |
AnnualRevenue is highly correlated with Activos__c and 1 other fields | High correlation |
EgresosAnuales__c is highly correlated with Activos__c and 1 other fields | High correlation |
total_siniestros is highly correlated with total_pagado_smmlv | High correlation |
total_pagado_smmlv is highly correlated with total_siniestros | High correlation |
anios_ultimo_siniestro is highly correlated with AnnualRevenue and 1 other fields | High correlation |
AnnualRevenue is highly correlated with anios_ultimo_siniestro and 1 other fields | High correlation |
EgresosAnuales__c is highly correlated with anios_ultimo_siniestro and 1 other fields | High correlation |
total_siniestros is highly correlated with total_pagado_smmlv and 1 other fields | High correlation |
total_pagado_smmlv is highly correlated with total_siniestros and 1 other fields | High correlation |
anios_ultimo_siniestro is highly correlated with total_siniestros and 1 other fields | High correlation |
AnnualRevenue is highly correlated with EgresosAnuales__c | High correlation |
EgresosAnuales__c is highly correlated with AnnualRevenue | High correlation |
tipo_prod_desc is highly correlated with MarcaVehiculo__c and 2 other fields | High correlation |
n_prod_prev is highly correlated with MarcaVehiculo__c and 3 other fields | High correlation |
CodigoTipoAsegurado__c is highly correlated with MarcaVehiculo__c and 1 other fields | High correlation |
MarcaVehiculo__c is highly correlated with tipo_prod_desc and 10 other fields | High correlation |
Genero__pc is highly correlated with MarcaVehiculo__c and 3 other fields | High correlation |
FechaInicioVigencia__ctrim is highly correlated with MarcaVehiculo__c and 1 other fields | High correlation |
MdeloVehiculo__c is highly correlated with tipo_prod_desc and 10 other fields | High correlation |
TipoVehiculo__c is highly correlated with n_prod_prev and 5 other fields | High correlation |
ciudad_name is highly correlated with MarcaVehiculo__c and 1 other fields | High correlation |
tipo_poliza_name is highly correlated with tipo_prod_desc and 3 other fields | High correlation |
EstadoCivil__pc is highly correlated with MarcaVehiculo__c and 3 other fields | High correlation |
churn is highly correlated with n_prod_prev and 2 other fields | High correlation |
CodigoTipoAsegurado__c is highly correlated with n_prod_prev and 1 other fields | High correlation |
PuntoVenta__c is highly correlated with ClaseVehiculo__c and 1 other fields | High correlation |
tipo_poliza_name is highly correlated with tipo_prod_desc and 9 other fields | High correlation |
tipo_prod_desc is highly correlated with tipo_poliza_name and 2 other fields | High correlation |
ClaseVehiculo__c is highly correlated with PuntoVenta__c and 9 other fields | High correlation |
TipoVehiculo__c is highly correlated with PuntoVenta__c and 9 other fields | High correlation |
NumeroPoliza__c is highly correlated with tipo_poliza_name and 6 other fields | High correlation |
FechaInicioVigencia__ctrim is highly correlated with tipo_poliza_name and 3 other fields | High correlation |
churn is highly correlated with ClaseVehiculo__c and 3 other fields | High correlation |
n_prod_prev is highly correlated with CodigoTipoAsegurado__c and 10 other fields | High correlation |
total_siniestros is highly correlated with tipo_poliza_name and 3 other fields | High correlation |
total_pagado_smmlv is highly correlated with CodigoTipoAsegurado__c and 7 other fields | High correlation |
anios_ultimo_siniestro is highly correlated with Activos__c and 2 other fields | High correlation |
Activos__c is highly correlated with n_prod_prev and 4 other fields | High correlation |
AnnualRevenue is highly correlated with n_prod_prev and 3 other fields | High correlation |
MontoAnual__c is highly correlated with tipo_poliza_name and 4 other fields | High correlation |
OtrosIngresos__c is highly correlated with Activos__c and 1 other fields | High correlation |
EgresosAnuales__c is highly correlated with n_prod_prev and 4 other fields | High correlation |
EstadoCivil__pc is highly correlated with ClaseVehiculo__c and 3 other fields | High correlation |
Genero__pc is highly correlated with tipo_poliza_name and 2 other fields | High correlation |
edad is highly correlated with ClaseVehiculo__c and 1 other fields | High correlation |
MarcaVehiculo__c has 12899 (20.0%) missing values | Missing |
MdeloVehiculo__c has 12899 (20.0%) missing values | Missing |
n_prod_prev has 61750 (95.9%) missing values | Missing |
total_siniestros has 60246 (93.5%) missing values | Missing |
total_pagado_smmlv has 60246 (93.5%) missing values | Missing |
anios_ultimo_siniestro has 60246 (93.5%) missing values | Missing |
Activos__c has 58200 (90.4%) missing values | Missing |
AnnualRevenue has 58200 (90.4%) missing values | Missing |
MontoAnual__c has 64395 (> 99.9%) missing values | Missing |
OtrosIngresos__c has 59709 (92.7%) missing values | Missing |
Profesion__pc has 64404 (100.0%) missing values | Missing |
EgresosAnuales__c has 58200 (90.4%) missing values | Missing |
EstadoCivil__pc has 9823 (15.3%) missing values | Missing |
Genero__pc has 9823 (15.3%) missing values | Missing |
ciudad_name has 9823 (15.3%) missing values | Missing |
edad has 56027 (87.0%) missing values | Missing |
OtrosIngresos__c is highly skewed (γ1 = 41.12936967) | Skewed |
Profesion__pc is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
OtrosIngresos__c has 4402 (6.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-10 03:34:33.186994 |
|---|---|
| Analysis finished | 2022-05-10 03:41:12.268564 |
| Duration | 6 minutes and 39.08 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.6 MiB |
| 1 | |
|---|---|
| 4 | 1256 |
| 2 | 838 |
| 3 | 759 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 64404 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 61551 | |
| 4 | 1256 | 2.0% |
| 2 | 838 | 1.3% |
| 3 | 759 | 1.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 61551 | |
| 4 | 1256 | 2.0% |
| 2 | 838 | 1.3% |
| 3 | 759 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 61551 | |
| 4 | 1256 | 2.0% |
| 2 | 838 | 1.3% |
| 3 | 759 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64404 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61551 | |
| 4 | 1256 | 2.0% |
| 2 | 838 | 1.3% |
| 3 | 759 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 64404 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 61551 | |
| 4 | 1256 | 2.0% |
| 2 | 838 | 1.3% |
| 3 | 759 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 61551 | |
| 4 | 1256 | 2.0% |
| 2 | 838 | 1.3% |
| 3 | 759 | 1.2% |
| Distinct | 1394 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7529.805261 |
| Minimum | 1 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 301 |
| Q1 | 1778 |
| median | 9672 |
| Q3 | 12285 |
| 95-th percentile | 12845 |
| Maximum | 99999 |
| Range | 99998 |
| Interquartile range (IQR) | 10507 |
Descriptive statistics
| Standard deviation | 5009.310331 |
|---|---|
| Coefficient of variation (CV) | 0.6652642609 |
| Kurtosis | 0.1363023156 |
| Mean | 7529.805261 |
| Median Absolute Deviation (MAD) | 2946 |
| Skewness | -0.2235605019 |
| Sum | 484949578 |
| Variance | 25093190 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3301 | 1863 | 2.9% |
| 12190 | 1647 | 2.6% |
| 7002 | 1362 | 2.1% |
| 1149 | 1065 | 1.7% |
| 19 | 979 | 1.5% |
| 9721 | 977 | 1.5% |
| 610 | 963 | 1.5% |
| 12254 | 836 | 1.3% |
| 1503 | 740 | 1.1% |
| 103 | 736 | 1.1% |
| Other values (1384) | 53236 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 188 | |
| 7 | 1 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 5 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 14 | 38 | 0.1% |
| 15 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 1 | < 0.1% |
| 20001 | 11 | |
| 13093 | 4 | < 0.1% |
| 13088 | 8 | |
| 13083 | 1 | < 0.1% |
| 13080 | 3 | < 0.1% |
| 13076 | 3 | < 0.1% |
| 13074 | 9 | |
| 13072 | 17 | |
| 13071 | 3 | < 0.1% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 MiB |
| s.o.a.t | |
|---|---|
| individual | 4596 |
| responsabilidad civil | 2602 |
| otras | 1564 |
| de daños tradicional | 1136 |
| Other values (9) | 3001 |
Length
| Max length | 45 |
|---|---|
| Median length | 7 |
| Mean length | 8.543397926 |
| Min length | 5 |
Characters and Unicode
| Total characters | 550229 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | de daños tradicional |
|---|---|
| 2nd row | de daños |
| 3rd row | flotante |
| 4th row | responsabilidad civil |
| 5th row | colectiva |
Common Values
| Value | Count | Frequency (%) |
| s.o.a.t | 51505 | |
| individual | 4596 | 7.1% |
| responsabilidad civil | 2602 | 4.0% |
| otras | 1564 | 2.4% |
| de daños tradicional | 1136 | 1.8% |
| de deudores hipotecarios | 743 | 1.2% |
| de daños | 470 | 0.7% |
| flotante | 456 | 0.7% |
| todo riesgo de obras civiles daños materiales | 416 | 0.6% |
| global sector privado | 412 | 0.6% |
| Other values (4) | 504 | 0.8% |
Length
| Value | Count | Frequency (%) |
| s.o.a.t | 51505 | |
| individual | 4596 | 6.1% |
| de | 2856 | 3.8% |
| responsabilidad | 2602 | 3.5% |
| civil | 2602 | 3.5% |
| daños | 2022 | 2.7% |
| otras | 1564 | 2.1% |
| tradicional | 1136 | 1.5% |
| deudores | 743 | 1.0% |
| hipotecarios | 743 | 1.0% |
| Other values (18) | 4568 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 154515 | |
| a | 71273 | |
| o | 64942 | |
| s | 64154 | |
| t | 57463 | 10.4% |
| i | 30394 | 5.5% |
| d | 22882 | 4.2% |
| l | 13576 | 2.5% |
| e | 10717 | 1.9% |
| 10533 | 1.9% | |
| Other values (12) | 49780 | 9.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 385181 | |
| Other Punctuation | 154515 | |
| Space Separator | 10533 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 71273 | |
| o | 64942 | |
| s | 64154 | |
| t | 57463 | |
| i | 30394 | |
| d | 22882 | 5.9% |
| l | 13576 | 3.5% |
| e | 10717 | 2.8% |
| n | 9203 | 2.4% |
| r | 9182 | 2.4% |
| Other values (10) | 31395 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 154515 |
Space Separator
| Value | Count | Frequency (%) |
| 10533 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 385181 | |
| Common | 165048 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 71273 | |
| o | 64942 | |
| s | 64154 | |
| t | 57463 | |
| i | 30394 | |
| d | 22882 | 5.9% |
| l | 13576 | 3.5% |
| e | 10717 | 2.8% |
| n | 9203 | 2.4% |
| r | 9182 | 2.4% |
| Other values (10) | 31395 |
Common
| Value | Count | Frequency (%) |
| . | 154515 | |
| 10533 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 548207 | |
| None | 2022 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 154515 | |
| a | 71273 | |
| o | 64942 | |
| s | 64154 | |
| t | 57463 | 10.5% |
| i | 30394 | 5.5% |
| d | 22882 | 4.2% |
| l | 13576 | 2.5% |
| e | 10717 | 2.0% |
| 10533 | 1.9% | |
| Other values (11) | 47758 | 8.7% |
None
| Value | Count | Frequency (%) |
| ñ | 2022 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.8 MiB |
| otras | |
|---|---|
| convenios | 1230 |
| au excepciones | 482 |
| au ded unic liv | 470 |
| disp legales | 24 |
Length
| Max length | 15 |
|---|---|
| Median length | 5 |
| Mean length | 5.219334203 |
| Min length | 5 |
Characters and Unicode
| Total characters | 336146 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | otras |
|---|---|
| 2nd row | otras |
| 3rd row | otras |
| 4th row | otras |
| 5th row | otras |
Common Values
| Value | Count | Frequency (%) |
| otras | 62198 | |
| convenios | 1230 | 1.9% |
| au excepciones | 482 | 0.7% |
| au ded unic liv | 470 | 0.7% |
| disp legales | 24 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| otras | 62198 | |
| convenios | 1230 | 1.9% |
| au | 952 | 1.4% |
| excepciones | 482 | 0.7% |
| ded | 470 | 0.7% |
| unic | 470 | 0.7% |
| liv | 470 | 0.7% |
| disp | 24 | < 0.1% |
| legales | 24 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 65140 | |
| s | 63958 | |
| a | 63174 | |
| r | 62198 | |
| t | 62198 | |
| n | 3412 | 1.0% |
| e | 3194 | 1.0% |
| i | 2676 | 0.8% |
| c | 2664 | 0.8% |
| 1916 | 0.6% | |
| Other values (7) | 5616 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 334230 | |
| Space Separator | 1916 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 65140 | |
| s | 63958 | |
| a | 63174 | |
| r | 62198 | |
| t | 62198 | |
| n | 3412 | 1.0% |
| e | 3194 | 1.0% |
| i | 2676 | 0.8% |
| c | 2664 | 0.8% |
| v | 1700 | 0.5% |
| Other values (6) | 3916 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 334230 | |
| Common | 1916 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 65140 | |
| s | 63958 | |
| a | 63174 | |
| r | 62198 | |
| t | 62198 | |
| n | 3412 | 1.0% |
| e | 3194 | 1.0% |
| i | 2676 | 0.8% |
| c | 2664 | 0.8% |
| v | 1700 | 0.5% |
| Other values (6) | 3916 | 1.2% |
Common
| Value | Count | Frequency (%) |
| 1916 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 336146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 65140 | |
| s | 63958 | |
| a | 63174 | |
| r | 62198 | |
| t | 62198 | |
| n | 3412 | 1.0% |
| e | 3194 | 1.0% |
| i | 2676 | 0.8% |
| c | 2664 | 0.8% |
| 1916 | 0.6% | |
| Other values (7) | 5616 | 1.7% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20029.1162 |
| Minimum | 1 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 99999 |
| Maximum | 99999 |
| Range | 99998 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 40020.56009 |
|---|---|
| Coefficient of variation (CV) | 1.998119123 |
| Kurtosis | 0.2434989554 |
| Mean | 20029.1162 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.497828892 |
| Sum | 1289955200 |
| Variance | 1601645230 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 46371 | |
| 99999 | 12899 | 20.0% |
| 5 | 2819 | 4.4% |
| 2 | 1275 | 2.0% |
| 3 | 474 | 0.7% |
| 7 | 209 | 0.3% |
| 6 | 191 | 0.3% |
| 4 | 84 | 0.1% |
| 9 | 60 | 0.1% |
| 8 | 22 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 46371 | |
| 2 | 1275 | 2.0% |
| 3 | 474 | 0.7% |
| 4 | 84 | 0.1% |
| 5 | 2819 | 4.4% |
| 6 | 191 | 0.3% |
| 7 | 209 | 0.3% |
| 8 | 22 | < 0.1% |
| 9 | 60 | 0.1% |
| 99999 | 12899 | 20.0% |
| Value | Count | Frequency (%) |
| 99999 | 12899 | 20.0% |
| 9 | 60 | 0.1% |
| 8 | 22 | < 0.1% |
| 7 | 209 | 0.3% |
| 6 | 191 | 0.3% |
| 5 | 2819 | 4.4% |
| 4 | 84 | 0.1% |
| 3 | 474 | 0.7% |
| 2 | 1275 | 2.0% |
| 1 | 46371 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12899 |
| Missing (%) | 20.0% |
| Memory size | 3.4 MiB |
| 97.0 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 206020 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 97.0 |
|---|---|
| 2nd row | 97.0 |
| 3rd row | 97.0 |
| 4th row | 97.0 |
| 5th row | 97.0 |
Common Values
| Value | Count | Frequency (%) |
| 97.0 | 51505 | |
| (Missing) | 12899 | 20.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 97.0 | 51505 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 51505 | |
| 7 | 51505 | |
| . | 51505 | |
| 0 | 51505 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 154515 | |
| Other Punctuation | 51505 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 51505 | |
| 7 | 51505 | |
| 0 | 51505 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 51505 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 206020 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 51505 | |
| 7 | 51505 | |
| . | 51505 | |
| 0 | 51505 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 206020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 51505 | |
| 7 | 51505 | |
| . | 51505 | |
| 0 | 51505 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12899 |
| Missing (%) | 20.0% |
| Memory size | 3.5 MiB |
| 999.0 |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 257525 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 999.0 |
|---|---|
| 2nd row | 999.0 |
| 3rd row | 999.0 |
| 4th row | 999.0 |
| 5th row | 999.0 |
Common Values
| Value | Count | Frequency (%) |
| 999.0 | 51505 | |
| (Missing) | 12899 | 20.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 999.0 | 51505 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 154515 | |
| . | 51505 | 20.0% |
| 0 | 51505 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 206020 | |
| Other Punctuation | 51505 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 154515 | |
| 0 | 51505 | 25.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 51505 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 257525 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 154515 | |
| . | 51505 | 20.0% |
| 0 | 51505 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 257525 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 154515 | |
| . | 51505 | 20.0% |
| 0 | 51505 | 20.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.6 MiB |
| 0 | |
|---|---|
| 99999 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.801130365 |
| Min length | 1 |
Characters and Unicode
| Total characters | 116000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 99999 |
|---|---|
| 2nd row | 99999 |
| 3rd row | 99999 |
| 4th row | 99999 |
| 5th row | 99999 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 51505 | |
| 99999 | 12899 | 20.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 51505 | |
| 99999 | 12899 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 64495 | |
| 0 | 51505 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 116000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 64495 | |
| 0 | 51505 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 116000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 64495 | |
| 0 | 51505 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 116000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 64495 | |
| 0 | 51505 |
| Distinct | 60441 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3954909.092 |
| Minimum | 1000002 |
|---|---|
| Maximum | 4845222 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 1000002 |
|---|---|
| 5-th percentile | 1003854.15 |
| Q1 | 4139160.75 |
| median | 4587055.5 |
| Q3 | 4618324.25 |
| 95-th percentile | 4631331.85 |
| Maximum | 4845222 |
| Range | 3845220 |
| Interquartile range (IQR) | 479163.5 |
Descriptive statistics
| Standard deviation | 1154304.525 |
|---|---|
| Coefficient of variation (CV) | 0.2918662601 |
| Kurtosis | 2.046573422 |
| Mean | 3954909.092 |
| Median Absolute Deviation (MAD) | 44192 |
| Skewness | -1.875809211 |
| Sum | 2.547119651 × 1011 |
| Variance | 1.332418937 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1004261 | 11 | < 0.1% |
| 1001176 | 11 | < 0.1% |
| 1001182 | 11 | < 0.1% |
| 1004259 | 11 | < 0.1% |
| 1001214 | 10 | < 0.1% |
| 1001179 | 10 | < 0.1% |
| 1004276 | 10 | < 0.1% |
| 1004239 | 10 | < 0.1% |
| 1000489 | 10 | < 0.1% |
| 1001257 | 9 | < 0.1% |
| Other values (60431) | 64301 |
| Value | Count | Frequency (%) |
| 1000002 | 5 | |
| 1000004 | 5 | |
| 1000006 | 3 | |
| 1000007 | 1 | < 0.1% |
| 1000009 | 5 | |
| 1000010 | 4 | |
| 1000013 | 1 | < 0.1% |
| 1000014 | 3 | |
| 1000015 | 3 | |
| 1000016 | 4 |
| Value | Count | Frequency (%) |
| 4845222 | 1 | |
| 4663419 | 1 | |
| 4661342 | 1 | |
| 4649462 | 1 | |
| 4649406 | 1 | |
| 4645719 | 1 | |
| 4645718 | 1 | |
| 4640593 | 1 | |
| 4634789 | 1 | |
| 4634788 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
| 02-2021 | |
|---|---|
| 01-2021 | 3195 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 450828 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 02-2021 |
|---|---|
| 2nd row | 02-2021 |
| 3rd row | 02-2021 |
| 4th row | 01-2021 |
| 5th row | 02-2021 |
Common Values
| Value | Count | Frequency (%) |
| 02-2021 | 61209 | |
| 01-2021 | 3195 | 5.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 02-2021 | 61209 | |
| 01-2021 | 3195 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 190017 | |
| 0 | 128808 | |
| 1 | 67599 | 15.0% |
| - | 64404 | 14.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 386424 | |
| Dash Punctuation | 64404 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 190017 | |
| 0 | 128808 | |
| 1 | 67599 | 17.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 64404 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 450828 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 190017 | |
| 0 | 128808 | |
| 1 | 67599 | 15.0% |
| - | 64404 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 450828 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 190017 | |
| 0 | 128808 | |
| 1 | 67599 | 15.0% |
| - | 64404 | 14.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.6 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 64404 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 54055 | |
| 1 | 10349 | 16.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 54055 | |
| 1 | 10349 | 16.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 54055 | |
| 1 | 10349 | 16.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64404 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 54055 | |
| 1 | 10349 | 16.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 64404 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 54055 | |
| 1 | 10349 | 16.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 54055 | |
| 1 | 10349 | 16.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 61750 |
| Missing (%) | 95.9% |
| Memory size | 2.5 MiB |
| 1.0 | |
|---|---|
| 3.0 | |
| 8.0 | |
| 2.0 | 80 |
| 4.0 | 8 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 7962 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 8.0 |
|---|---|
| 2nd row | 8.0 |
| 3rd row | 8.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1071 | 1.7% |
| 3.0 | 1021 | 1.6% |
| 8.0 | 474 | 0.7% |
| 2.0 | 80 | 0.1% |
| 4.0 | 8 | < 0.1% |
| (Missing) | 61750 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 1071 | |
| 3.0 | 1021 | |
| 8.0 | 474 | |
| 2.0 | 80 | 3.0% |
| 4.0 | 8 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2654 | |
| 0 | 2654 | |
| 1 | 1071 | |
| 3 | 1021 | 12.8% |
| 8 | 474 | 6.0% |
| 2 | 80 | 1.0% |
| 4 | 8 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5308 | |
| Other Punctuation | 2654 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2654 | |
| 1 | 1071 | |
| 3 | 1021 | 19.2% |
| 8 | 474 | 8.9% |
| 2 | 80 | 1.5% |
| 4 | 8 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2654 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7962 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2654 | |
| 0 | 2654 | |
| 1 | 1071 | |
| 3 | 1021 | 12.8% |
| 8 | 474 | 6.0% |
| 2 | 80 | 1.0% |
| 4 | 8 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7962 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2654 | |
| 0 | 2654 | |
| 1 | 1071 | |
| 3 | 1021 | 12.8% |
| 8 | 474 | 6.0% |
| 2 | 80 | 1.0% |
| 4 | 8 | 0.1% |
total_siniestros
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 32 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 60246 |
| Missing (%) | 93.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.32924483 |
| Minimum | 1 |
|---|---|
| Maximum | 940 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 20 |
| Q3 | 38 |
| 95-th percentile | 150 |
| Maximum | 940 |
| Range | 939 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 63.0592206 |
|---|---|
| Coefficient of variation (CV) | 1.304783901 |
| Kurtosis | 27.59590949 |
| Mean | 48.32924483 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 2.855234945 |
| Sum | 200953 |
| Variance | 3976.465303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 150 | 974 | 1.5% |
| 1 | 786 | 1.2% |
| 37 | 548 | 0.9% |
| 38 | 475 | 0.7% |
| 16 | 247 | 0.4% |
| 3 | 238 | 0.4% |
| 2 | 218 | 0.3% |
| 4 | 142 | 0.2% |
| 8 | 96 | 0.1% |
| 5 | 65 | 0.1% |
| Other values (22) | 369 | 0.6% |
| (Missing) | 60246 |
| Value | Count | Frequency (%) |
| 1 | 786 | |
| 2 | 218 | 0.3% |
| 3 | 238 | 0.4% |
| 4 | 142 | 0.2% |
| 5 | 65 | 0.1% |
| 6 | 33 | 0.1% |
| 7 | 52 | 0.1% |
| 8 | 96 | 0.1% |
| 9 | 27 | < 0.1% |
| 10 | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 940 | 3 | < 0.1% |
| 150 | 974 | |
| 90 | 2 | < 0.1% |
| 80 | 4 | < 0.1% |
| 52 | 7 | < 0.1% |
| 51 | 1 | < 0.1% |
| 45 | 4 | < 0.1% |
| 38 | 475 | |
| 37 | 548 | |
| 36 | 2 | < 0.1% |
total_pagado_smmlv
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 386 |
|---|---|
| Distinct (%) | 9.3% |
| Missing | 60246 |
| Missing (%) | 93.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2670.067165 |
| Minimum | 0 |
|---|---|
| Maximum | 55871.95629 |
| Zeros | 627 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 17.35001589 |
| median | 370.2541766 |
| Q3 | 3065.269972 |
| 95-th percentile | 8833.286309 |
| Maximum | 55871.95629 |
| Range | 55871.95629 |
| Interquartile range (IQR) | 3047.919956 |
Descriptive statistics
| Standard deviation | 3785.493099 |
|---|---|
| Coefficient of variation (CV) | 1.417752013 |
| Kurtosis | 18.08380843 |
| Mean | 2670.067165 |
| Median Absolute Deviation (MAD) | 370.2541766 |
| Skewness | 2.27192553 |
| Sum | 11102139.27 |
| Variance | 14329958 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8833.286309 | 974 | 1.5% |
| 0 | 627 | 1.0% |
| 1306.925184 | 527 | 0.8% |
| 3065.269972 | 474 | 0.7% |
| 57.0596277 | 218 | 0.3% |
| 17.35001589 | 126 | 0.2% |
| 38.70535986 | 81 | 0.1% |
| 207.8908871 | 53 | 0.1% |
| 87.77731669 | 48 | 0.1% |
| 50.85823109 | 45 | 0.1% |
| Other values (376) | 985 | 1.5% |
| (Missing) | 60246 |
| Value | Count | Frequency (%) |
| 0 | 627 | |
| 0.1679049361 | 1 | < 0.1% |
| 0.225420076 | 2 | < 0.1% |
| 0.2439357817 | 1 | < 0.1% |
| 0.2557769398 | 1 | < 0.1% |
| 0.2844277434 | 1 | < 0.1% |
| 0.2931979932 | 1 | < 0.1% |
| 0.3073748027 | 1 | < 0.1% |
| 0.3077402298 | 1 | < 0.1% |
| 0.333331132 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 55871.95629 | 2 | < 0.1% |
| 22272.9517 | 3 | < 0.1% |
| 8833.286309 | 974 | |
| 4385.698773 | 2 | < 0.1% |
| 3065.269972 | 474 | |
| 2345.751903 | 3 | < 0.1% |
| 2265.861102 | 1 | < 0.1% |
| 1556.751448 | 1 | < 0.1% |
| 1446.628935 | 1 | < 0.1% |
| 1306.925184 | 527 |
anios_ultimo_siniestro
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 221 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 60246 |
| Missing (%) | 93.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2722416599 |
| Minimum | 0.002739726027 |
|---|---|
| Maximum | 9.465753425 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0.002739726027 |
|---|---|
| 5-th percentile | 0.002739726027 |
| Q1 | 0.005479452055 |
| median | 0.01095890411 |
| Q3 | 0.08493150685 |
| 95-th percentile | 1.606986301 |
| Maximum | 9.465753425 |
| Range | 9.463013699 |
| Interquartile range (IQR) | 0.07945205479 |
Descriptive statistics
| Standard deviation | 0.9555272766 |
|---|---|
| Coefficient of variation (CV) | 3.509849583 |
| Kurtosis | 34.33231817 |
| Mean | 0.2722416599 |
| Median Absolute Deviation (MAD) | 0.008219178082 |
| Skewness | 5.496456599 |
| Sum | 1131.980822 |
| Variance | 0.9130323764 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.002739726027 | 990 | 1.5% |
| 0.008219178082 | 559 | 0.9% |
| 0.01095890411 | 557 | 0.9% |
| 0.005479452055 | 395 | 0.6% |
| 0.09315068493 | 137 | 0.2% |
| 0.1095890411 | 101 | 0.2% |
| 0.07123287671 | 70 | 0.1% |
| 0.01643835616 | 56 | 0.1% |
| 0.07671232877 | 55 | 0.1% |
| 0.03287671233 | 43 | 0.1% |
| Other values (211) | 1195 | 1.9% |
| (Missing) | 60246 |
| Value | Count | Frequency (%) |
| 0.002739726027 | 990 | |
| 0.005479452055 | 395 | 0.6% |
| 0.008219178082 | 559 | |
| 0.01095890411 | 557 | |
| 0.01369863014 | 35 | 0.1% |
| 0.01643835616 | 56 | 0.1% |
| 0.02191780822 | 13 | < 0.1% |
| 0.02465753425 | 32 | < 0.1% |
| 0.02739726027 | 6 | < 0.1% |
| 0.0301369863 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.465753425 | 1 | < 0.1% |
| 8.75890411 | 6 | |
| 8 | 5 | |
| 7.531506849 | 2 | < 0.1% |
| 7.178082192 | 6 | |
| 7.101369863 | 3 | < 0.1% |
| 7.005479452 | 3 | < 0.1% |
| 6.994520548 | 1 | < 0.1% |
| 6.235616438 | 10 | |
| 6.008219178 | 7 |
| Distinct | 1655 |
|---|---|
| Distinct (%) | 26.7% |
| Missing | 58200 |
| Missing (%) | 90.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 513144079.7 |
| Minimum | 0 |
|---|---|
| Maximum | 8.31 × 1010 |
| Zeros | 50 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3000000 |
| Q1 | 50000000 |
| median | 123219000 |
| Q3 | 331112500 |
| 95-th percentile | 1558311000 |
| Maximum | 8.31 × 1010 |
| Range | 8.31 × 1010 |
| Interquartile range (IQR) | 281112500 |
Descriptive statistics
| Standard deviation | 2312679344 |
|---|---|
| Coefficient of variation (CV) | 4.506881079 |
| Kurtosis | 395.0702999 |
| Mean | 513144079.7 |
| Median Absolute Deviation (MAD) | 98219000 |
| Skewness | 16.36979187 |
| Sum | 3.183545871 × 1012 |
| Variance | 5.348485747 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000000 | 205 | 0.3% |
| 200000000 | 179 | 0.3% |
| 50000000 | 174 | 0.3% |
| 80000000 | 163 | 0.3% |
| 150000000 | 158 | 0.2% |
| 120000000 | 127 | 0.2% |
| 60000000 | 125 | 0.2% |
| 300000000 | 116 | 0.2% |
| 10000000 | 109 | 0.2% |
| 30000000 | 102 | 0.2% |
| Other values (1645) | 4746 | 7.4% |
| (Missing) | 58200 |
| Value | Count | Frequency (%) |
| 0 | 50 | |
| 1 | 90 | |
| 2 | 38 | |
| 20 | 3 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 107254 | 1 | < 0.1% |
| 127000 | 1 | < 0.1% |
| 200000 | 2 | < 0.1% |
| 230000 | 1 | < 0.1% |
| 300000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8.31 × 1010 | 1 | < 0.1% |
| 5.643561885 × 1010 | 1 | < 0.1% |
| 3.3401125 × 1010 | 3 | |
| 3.2934155 × 1010 | 7 | |
| 2.8207083 × 1010 | 1 | < 0.1% |
| 2.379921321 × 1010 | 1 | < 0.1% |
| 1.9483034 × 1010 | 3 | |
| 1.9263183 × 1010 | 3 | |
| 1.7267744 × 1010 | 7 | |
| 1.56067031 × 1010 | 5 |
AnnualRevenue
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1830 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 58200 |
| Missing (%) | 90.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 304876044.3 |
| Minimum | 0 |
|---|---|
| Maximum | 7.2539149 × 1010 |
| Zeros | 67 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5500000 |
| Q1 | 24000000 |
| median | 43000000 |
| Q3 | 96000000 |
| 95-th percentile | 769247174.6 |
| Maximum | 7.2539149 × 1010 |
| Range | 7.2539149 × 1010 |
| Interquartile range (IQR) | 72000000 |
Descriptive statistics
| Standard deviation | 2063906208 |
|---|---|
| Coefficient of variation (CV) | 6.76965687 |
| Kurtosis | 393.1213775 |
| Mean | 304876044.3 |
| Median Absolute Deviation (MAD) | 26673126 |
| Skewness | 17.16512571 |
| Sum | 1.891450979 × 1012 |
| Variance | 4.259708835 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36000000 | 252 | 0.4% |
| 24000000 | 209 | 0.3% |
| 60000000 | 182 | 0.3% |
| 30000000 | 177 | 0.3% |
| 48000000 | 165 | 0.3% |
| 12000000 | 135 | 0.2% |
| 40000000 | 104 | 0.2% |
| 18000000 | 100 | 0.2% |
| 50000000 | 93 | 0.1% |
| 42000000 | 75 | 0.1% |
| Other values (1820) | 4712 | 7.3% |
| (Missing) | 58200 |
| Value | Count | Frequency (%) |
| 0 | 67 | |
| 1 | 13 | < 0.1% |
| 52230 | 1 | < 0.1% |
| 90790 | 1 | < 0.1% |
| 240000 | 1 | < 0.1% |
| 500000 | 1 | < 0.1% |
| 600000 | 1 | < 0.1% |
| 835000 | 1 | < 0.1% |
| 877000 | 1 | < 0.1% |
| 900000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7.2539149 × 1010 | 1 | < 0.1% |
| 4.24173603 × 1010 | 2 | < 0.1% |
| 3.408 × 1010 | 1 | < 0.1% |
| 3.243391822 × 1010 | 1 | < 0.1% |
| 2.9458395 × 1010 | 1 | < 0.1% |
| 2.9375273 × 1010 | 7 | |
| 2.765797 × 1010 | 3 | |
| 2.010036 × 1010 | 1 | < 0.1% |
| 1.897052621 × 1010 | 5 | |
| 1.7620986 × 1010 | 1 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 64395 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2579.111111 |
| Minimum | 0 |
|---|---|
| Maximum | 20000 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 10 |
| Q3 | 102 |
| 95-th percentile | 13200 |
| Maximum | 20000 |
| Range | 20000 |
| Interquartile range (IQR) | 102 |
Descriptive statistics
| Standard deviation | 6606.381166 |
|---|---|
| Coefficient of variation (CV) | 2.561495369 |
| Kurtosis | 8.4219912 |
| Mean | 2579.111111 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 2.882336495 |
| Sum | 23212 |
| Variance | 43644272.11 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 3000 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 20000 | 1 | < 0.1% |
| (Missing) | 64395 |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 10 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 3000 | 1 | < 0.1% |
| 20000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 20000 | 1 | < 0.1% |
| 3000 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 0 | 4 |
| Distinct | 142 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 59709 |
| Missing (%) | 92.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4471255.486 |
| Minimum | 0 |
|---|---|
| Maximum | 3479487967 |
| Zeros | 4402 |
| Zeros (%) | 6.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 4200000 |
| Maximum | 3479487967 |
| Range | 3479487967 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 61763776.62 |
|---|---|
| Coefficient of variation (CV) | 13.8135199 |
| Kurtosis | 2171.466619 |
| Mean | 4471255.486 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 41.12936967 |
| Sum | 2.099254451 × 1010 |
| Variance | 3.814764103 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4402 | 6.8% |
| 15000000 | 11 | < 0.1% |
| 12000000 | 9 | < 0.1% |
| 10000000 | 9 | < 0.1% |
| 5000000 | 9 | < 0.1% |
| 20000000 | 8 | < 0.1% |
| 6000000 | 8 | < 0.1% |
| 165644000 | 7 | < 0.1% |
| 2000000 | 7 | < 0.1% |
| 36000000 | 6 | < 0.1% |
| Other values (132) | 219 | 0.3% |
| (Missing) | 59709 |
| Value | Count | Frequency (%) |
| 0 | 4402 | |
| 1 | 1 | < 0.1% |
| 2 | 2 | < 0.1% |
| 4000 | 4 | < 0.1% |
| 9000 | 1 | < 0.1% |
| 37000 | 1 | < 0.1% |
| 227000 | 2 | < 0.1% |
| 274000 | 1 | < 0.1% |
| 500000 | 1 | < 0.1% |
| 529000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3479487967 | 1 | < 0.1% |
| 893893000 | 2 | |
| 720625000 | 3 | |
| 655490000 | 1 | < 0.1% |
| 517969000 | 2 | |
| 468342000 | 3 | |
| 196218120 | 1 | < 0.1% |
| 194874000 | 1 | < 0.1% |
| 194601000 | 4 | |
| 187969000 | 3 |
EgresosAnuales__c
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1374 |
|---|---|
| Distinct (%) | 22.1% |
| Missing | 58200 |
| Missing (%) | 90.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 245358666.2 |
| Minimum | 0 |
|---|---|
| Maximum | 7.1738788 × 1010 |
| Zeros | 68 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2000000 |
| Q1 | 12000000 |
| median | 25000000 |
| Q3 | 58000000 |
| 95-th percentile | 592034849.4 |
| Maximum | 7.1738788 × 1010 |
| Range | 7.1738788 × 1010 |
| Interquartile range (IQR) | 46000000 |
Descriptive statistics
| Standard deviation | 1938474429 |
|---|---|
| Coefficient of variation (CV) | 7.900574531 |
| Kurtosis | 463.440573 |
| Mean | 245358666.2 |
| Median Absolute Deviation (MAD) | 15400000 |
| Skewness | 18.7591844 |
| Sum | 1.522205165 × 1012 |
| Variance | 3.757683113 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12000000 | 272 | 0.4% |
| 30000000 | 247 | 0.4% |
| 18000000 | 225 | 0.3% |
| 24000000 | 214 | 0.3% |
| 20000000 | 203 | 0.3% |
| 10000000 | 154 | 0.2% |
| 40000000 | 145 | 0.2% |
| 36000000 | 141 | 0.2% |
| 15000000 | 132 | 0.2% |
| 25000000 | 108 | 0.2% |
| Other values (1364) | 4363 | 6.8% |
| (Missing) | 58200 |
| Value | Count | Frequency (%) |
| 0 | 68 | |
| 1 | 39 | |
| 200000 | 1 | < 0.1% |
| 250000 | 1 | < 0.1% |
| 300000 | 2 | < 0.1% |
| 339561 | 1 | < 0.1% |
| 400000 | 2 | < 0.1% |
| 450000 | 14 | < 0.1% |
| 497395 | 1 | < 0.1% |
| 500000 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 7.1738788 × 1010 | 1 | < 0.1% |
| 4.033890964 × 1010 | 2 | < 0.1% |
| 3.472297457 × 1010 | 1 | < 0.1% |
| 3.36 × 1010 | 1 | < 0.1% |
| 2.8430656 × 1010 | 7 | |
| 2.7557799 × 1010 | 1 | < 0.1% |
| 2.5171626 × 1010 | 3 | |
| 1.9746108 × 1010 | 1 | < 0.1% |
| 1.663542953 × 1010 | 1 | < 0.1% |
| 1.6135595 × 1010 | 1 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9823 |
| Missing (%) | 15.3% |
| Memory size | 3.5 MiB |
| N A | |
|---|---|
| SOLTERO | |
| CASADO | 2709 |
| OTRO | 1476 |
| UNIDO | 120 |
| Other values (3) | 63 |
Length
| Max length | 10 |
|---|---|
| Median length | 3 |
| Mean length | 3.787691688 |
| Min length | 3 |
Characters and Unicode
| Total characters | 206736 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SOLTERO |
|---|---|
| 2nd row | OTRO |
| 3rd row | SOLTERO |
| 4th row | N A |
| 5th row | OTRO |
Common Values
| Value | Count | Frequency (%) |
| N A | 41989 | |
| SOLTERO | 8224 | 12.8% |
| CASADO | 2709 | 4.2% |
| OTRO | 1476 | 2.3% |
| UNIDO | 120 | 0.2% |
| VIUDO | 27 | < 0.1% |
| SEPARADO | 26 | < 0.1% |
| DIVORCIADO | 10 | < 0.1% |
| (Missing) | 9823 | 15.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| n | 41989 | |
| a | 41989 | |
| soltero | 8224 | 8.5% |
| casado | 2709 | 2.8% |
| otro | 1476 | 1.5% |
| unido | 120 | 0.1% |
| viudo | 27 | < 0.1% |
| separado | 26 | < 0.1% |
| divorciado | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 47469 | |
| N | 42109 | |
| 41989 | ||
| O | 22302 | |
| S | 10959 | 5.3% |
| R | 9736 | 4.7% |
| T | 9700 | 4.7% |
| E | 8250 | 4.0% |
| L | 8224 | 4.0% |
| D | 2902 | 1.4% |
| Other values (5) | 3096 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 164747 | |
| Space Separator | 41989 | 20.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 47469 | |
| N | 42109 | |
| O | 22302 | |
| S | 10959 | 6.7% |
| R | 9736 | 5.9% |
| T | 9700 | 5.9% |
| E | 8250 | 5.0% |
| L | 8224 | 5.0% |
| D | 2902 | 1.8% |
| C | 2719 | 1.7% |
| Other values (4) | 377 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 41989 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 164747 | |
| Common | 41989 | 20.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 47469 | |
| N | 42109 | |
| O | 22302 | |
| S | 10959 | 6.7% |
| R | 9736 | 5.9% |
| T | 9700 | 5.9% |
| E | 8250 | 5.0% |
| L | 8224 | 5.0% |
| D | 2902 | 1.8% |
| C | 2719 | 1.7% |
| Other values (4) | 377 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 41989 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 206736 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 47469 | |
| N | 42109 | |
| 41989 | ||
| O | 22302 | |
| S | 10959 | 5.3% |
| R | 9736 | 4.7% |
| T | 9700 | 4.7% |
| E | 8250 | 4.0% |
| L | 8224 | 4.0% |
| D | 2902 | 1.4% |
| Other values (5) | 3096 | 1.5% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9823 |
| Missing (%) | 15.3% |
| Memory size | 3.5 MiB |
| N A | |
|---|---|
| MASCULINO | |
| FEMENINO | 2696 |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 4.500045803 |
| Min length | 3 |
Characters and Unicode
| Total characters | 245617 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MASCULINO |
|---|---|
| 2nd row | MASCULINO |
| 3rd row | FEMENINO |
| 4th row | N A |
| 5th row | MASCULINO |
Common Values
| Value | Count | Frequency (%) |
| N A | 40486 | |
| MASCULINO | 11399 | 17.7% |
| FEMENINO | 2696 | 4.2% |
| (Missing) | 9823 | 15.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| n | 40486 | |
| a | 40486 | |
| masculino | 11399 | 12.0% |
| femenino | 2696 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 57277 | |
| A | 51885 | |
| 40486 | ||
| M | 14095 | 5.7% |
| I | 14095 | 5.7% |
| O | 14095 | 5.7% |
| S | 11399 | 4.6% |
| C | 11399 | 4.6% |
| U | 11399 | 4.6% |
| L | 11399 | 4.6% |
| Other values (2) | 8088 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 205131 | |
| Space Separator | 40486 | 16.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 57277 | |
| A | 51885 | |
| M | 14095 | 6.9% |
| I | 14095 | 6.9% |
| O | 14095 | 6.9% |
| S | 11399 | 5.6% |
| C | 11399 | 5.6% |
| U | 11399 | 5.6% |
| L | 11399 | 5.6% |
| E | 5392 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 40486 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 205131 | |
| Common | 40486 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 57277 | |
| A | 51885 | |
| M | 14095 | 6.9% |
| I | 14095 | 6.9% |
| O | 14095 | 6.9% |
| S | 11399 | 5.6% |
| C | 11399 | 5.6% |
| U | 11399 | 5.6% |
| L | 11399 | 5.6% |
| E | 5392 | 2.6% |
Common
| Value | Count | Frequency (%) |
| 40486 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 245617 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 57277 | |
| A | 51885 | |
| 40486 | ||
| M | 14095 | 5.7% |
| I | 14095 | 5.7% |
| O | 14095 | 5.7% |
| S | 11399 | 4.6% |
| C | 11399 | 4.6% |
| U | 11399 | 4.6% |
| L | 11399 | 4.6% |
| Other values (2) | 8088 | 3.3% |
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9823 |
| Missing (%) | 15.3% |
| Memory size | 3.8 MiB |
| otras | |
|---|---|
| BOGOTÁ D.C. | |
| CALI | |
| MEDELLIN | |
| CÚCUTA | 2519 |
| Other values (18) |
Length
| Max length | 13 |
|---|---|
| Median length | 5 |
| Mean length | 6.643209175 |
| Min length | 4 |
Characters and Unicode
| Total characters | 362593 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | otras |
|---|---|
| 2nd row | otras |
| 3rd row | otras |
| 4th row | otras |
| 5th row | CALI |
Common Values
| Value | Count | Frequency (%) |
| otras | 25851 | |
| BOGOTÁ D.C. | 4422 | 6.9% |
| CALI | 2858 | 4.4% |
| MEDELLIN | 2678 | 4.2% |
| CÚCUTA | 2519 | 3.9% |
| TULUÁ | 1326 | 2.1% |
| BARRANQUILLA | 1298 | 2.0% |
| ARMENIA | 1265 | 2.0% |
| PEREIRA | 1241 | 1.9% |
| BUCARAMANGA | 998 | 1.5% |
| Other values (13) | 10125 | 15.7% |
| (Missing) | 9823 | 15.3% |
Length
| Value | Count | Frequency (%) |
| otras | 25851 | |
| bogotá | 4422 | 7.1% |
| d.c | 4422 | 7.1% |
| cali | 2858 | 4.6% |
| medellin | 2678 | 4.3% |
| cúcuta | 2519 | 4.1% |
| tuluá | 1326 | 2.1% |
| barranquilla | 1298 | 2.1% |
| armenia | 1265 | 2.0% |
| pereira | 1241 | 2.0% |
| Other values (18) | 14041 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 34585 | 9.5% |
| o | 25851 | 7.1% |
| r | 25851 | 7.1% |
| a | 25851 | 7.1% |
| s | 25851 | 7.1% |
| t | 25851 | 7.1% |
| C | 16937 | 4.7% |
| I | 16617 | 4.6% |
| L | 16140 | 4.5% |
| E | 14620 | 4.0% |
| Other values (21) | 134439 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 217154 | |
| Lowercase Letter | 129255 | |
| Other Punctuation | 8844 | 2.4% |
| Space Separator | 7340 | 2.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 34585 | |
| C | 16937 | 7.8% |
| I | 16617 | 7.7% |
| L | 16140 | 7.4% |
| E | 14620 | 6.7% |
| O | 13345 | 6.1% |
| N | 12248 | 5.6% |
| T | 11865 | 5.5% |
| R | 11463 | 5.3% |
| U | 10205 | 4.7% |
| Other values (14) | 59129 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 25851 | |
| r | 25851 | |
| a | 25851 | |
| s | 25851 | |
| t | 25851 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8844 |
Space Separator
| Value | Count | Frequency (%) |
| 7340 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 346409 | |
| Common | 16184 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 34585 | 10.0% |
| o | 25851 | 7.5% |
| r | 25851 | 7.5% |
| a | 25851 | 7.5% |
| s | 25851 | 7.5% |
| t | 25851 | 7.5% |
| C | 16937 | 4.9% |
| I | 16617 | 4.8% |
| L | 16140 | 4.7% |
| E | 14620 | 4.2% |
| Other values (19) | 118255 |
Common
| Value | Count | Frequency (%) |
| . | 8844 | |
| 7340 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 352725 | |
| None | 9868 | 2.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 34585 | 9.8% |
| o | 25851 | 7.3% |
| r | 25851 | 7.3% |
| a | 25851 | 7.3% |
| s | 25851 | 7.3% |
| t | 25851 | 7.3% |
| C | 16937 | 4.8% |
| I | 16617 | 4.7% |
| L | 16140 | 4.6% |
| E | 14620 | 4.1% |
| Other values (18) | 124571 |
None
| Value | Count | Frequency (%) |
| Á | 6510 | |
| Ú | 2519 | 25.5% |
| É | 839 | 8.5% |
| Distinct | 5287 |
|---|---|
| Distinct (%) | 63.1% |
| Missing | 56027 |
| Missing (%) | 87.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.91773365 |
| Minimum | 1.506849315 |
|---|---|
| Maximum | 122.4328767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 503.3 KiB |
Quantile statistics
| Minimum | 1.506849315 |
|---|---|
| 5-th percentile | 26.02465753 |
| Q1 | 36.08493151 |
| median | 47.44383562 |
| Q3 | 60.60821918 |
| 95-th percentile | 122.4328767 |
| Maximum | 122.4328767 |
| Range | 120.9260274 |
| Interquartile range (IQR) | 24.52328767 |
Descriptive statistics
| Standard deviation | 24.95405359 |
|---|---|
| Coefficient of variation (CV) | 0.4715631579 |
| Kurtosis | 2.231559507 |
| Mean | 52.91773365 |
| Median Absolute Deviation (MAD) | 12.11780822 |
| Skewness | 1.575489523 |
| Sum | 443291.8548 |
| Variance | 622.7047906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 122.4328767 | 681 | 1.1% |
| 41.91232877 | 16 | < 0.1% |
| 76.21643836 | 15 | < 0.1% |
| 72.97260274 | 14 | < 0.1% |
| 42.38082192 | 13 | < 0.1% |
| 52.38630137 | 12 | < 0.1% |
| 57.63013699 | 12 | < 0.1% |
| 48.38356164 | 12 | < 0.1% |
| 42.10410959 | 11 | < 0.1% |
| 56.83013699 | 8 | < 0.1% |
| Other values (5277) | 7583 | 11.8% |
| (Missing) | 56027 |
| Value | Count | Frequency (%) |
| 1.506849315 | 1 | |
| 2.331506849 | 1 | |
| 5.542465753 | 1 | |
| 5.55890411 | 1 | |
| 6.284931507 | 1 | |
| 7.098630137 | 1 | |
| 7.323287671 | 1 | |
| 7.509589041 | 2 | |
| 7.591780822 | 1 | |
| 7.745205479 | 1 |
| Value | Count | Frequency (%) |
| 122.4328767 | 681 | |
| 112.7205479 | 1 | < 0.1% |
| 102.9945205 | 1 | < 0.1% |
| 94.96986301 | 1 | < 0.1% |
| 92.14794521 | 3 | < 0.1% |
| 90.84657534 | 2 | < 0.1% |
| 90.04931507 | 1 | < 0.1% |
| 89.71780822 | 2 | < 0.1% |
| 89.53972603 | 1 | < 0.1% |
| 88.79178082 | 2 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| CodigoTipoAsegurado__c | PuntoVenta__c | tipo_poliza_name | tipo_prod_desc | ClaseVehiculo__c | MarcaVehiculo__c | MdeloVehiculo__c | TipoVehiculo__c | NumeroPoliza__c | FechaInicioVigencia__ctrim | churn | n_prod_prev | total_siniestros | total_pagado_smmlv | anios_ultimo_siniestro | Activos__c | AnnualRevenue | MontoAnual__c | OtrosIngresos__c | Profesion__pc | EgresosAnuales__c | EstadoCivil__pc | Genero__pc | ciudad_name | edad | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 805 | de daños tradicional | otras | 99999 | NaN | NaN | 99999 | 1001140 | 02-2021 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 1 | 805 | de daños | otras | 99999 | NaN | NaN | 99999 | 1001140 | 02-2021 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 1 | 805 | flotante | otras | 99999 | NaN | NaN | 99999 | 1001140 | 02-2021 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 1 | 1203 | responsabilidad civil | otras | 99999 | NaN | NaN | 99999 | 1009923 | 01-2021 | 1 | NaN | NaN | NaN | NaN | 5.000000e+08 | 3.000000e+08 | NaN | NaN | NaN | 2.500000e+08 | SOLTERO | MASCULINO | otras | 34.775342 |
| 4 | 4 | 404 | colectiva | otras | 99999 | NaN | NaN | 99999 | 3034490 | 02-2021 | 1 | NaN | 4.0 | 0.000000 | 3.082192 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 1 | 1002 | individual | convenios | 99999 | NaN | NaN | 99999 | 3048712 | 01-2021 | 1 | 8.0 | 38.0 | 3065.269972 | 0.008219 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 1 | 1002 | individual | convenios | 99999 | NaN | NaN | 99999 | 3052432 | 01-2021 | 1 | 8.0 | 38.0 | 3065.269972 | 0.008219 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 1 | 15 | s.o.a.t | otras | 1 | 97.0 | 999.0 | 0 | 4593295 | 01-2021 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 1 | 23 | colectiva | otras | 99999 | NaN | NaN | 99999 | 3025528 | 02-2021 | 1 | NaN | NaN | NaN | NaN | 1.856000e+09 | 2.251415e+09 | NaN | 1870000.0 | NaN | 2.111152e+09 | OTRO | MASCULINO | otras | 41.156164 |
| 9 | 1 | 8042 | individual | convenios | 99999 | NaN | NaN | 99999 | 3075559 | 01-2021 | 0 | NaN | NaN | NaN | NaN | 8.000000e+07 | 4.000000e+07 | NaN | 0.0 | NaN | 1.400000e+07 | SOLTERO | FEMENINO | otras | 40.547945 |
Last rows
| CodigoTipoAsegurado__c | PuntoVenta__c | tipo_poliza_name | tipo_prod_desc | ClaseVehiculo__c | MarcaVehiculo__c | MdeloVehiculo__c | TipoVehiculo__c | NumeroPoliza__c | FechaInicioVigencia__ctrim | churn | n_prod_prev | total_siniestros | total_pagado_smmlv | anios_ultimo_siniestro | Activos__c | AnnualRevenue | MontoAnual__c | OtrosIngresos__c | Profesion__pc | EgresosAnuales__c | EstadoCivil__pc | Genero__pc | ciudad_name | edad | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 64394 | 4 | 1803 | responsabilidad civil | otras | 99999 | NaN | NaN | 99999 | 1008336 | 01-2021 | 1 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 64395 | 1 | 1002 | normal | otras | 99999 | NaN | NaN | 99999 | 1002679 | 01-2021 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 64396 | 1 | 23 | individual | otras | 99999 | NaN | NaN | 99999 | 3025434 | 01-2021 | 0 | NaN | NaN | NaN | NaN | 3.307350e+08 | 1.916790e+08 | NaN | 0.0 | NaN | 1.716010e+08 | SOLTERO | MASCULINO | otras | 31.394521 |
| 64397 | 4 | 1402 | de daños | otras | 99999 | NaN | NaN | 99999 | 1001179 | 01-2021 | 1 | NaN | 12.0 | 25.850198 | 0.073973 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 64398 | 1 | 3301 | otras | otras | 99999 | NaN | NaN | 99999 | 3000193 | 01-2021 | 1 | 2.0 | NaN | NaN | NaN | 3.429540e+09 | 1.625870e+09 | NaN | 34993000.0 | NaN | 1.289049e+09 | CASADO | MASCULINO | otras | 55.701370 |
| 64399 | 1 | 3301 | otras | otras | 99999 | NaN | NaN | 99999 | 3000192 | 01-2021 | 1 | 2.0 | NaN | NaN | NaN | 3.429540e+09 | 1.625870e+09 | NaN | 34993000.0 | NaN | 1.289049e+09 | CASADO | MASCULINO | otras | 55.701370 |
| 64400 | 1 | 3202 | individual | convenios | 99999 | NaN | NaN | 99999 | 3129700 | 01-2021 | 0 | NaN | NaN | NaN | NaN | 1.585329e+09 | 1.150000e+08 | NaN | 3000000.0 | NaN | 6.500000e+07 | CASADO | MASCULINO | otras | 42.380822 |
| 64401 | 1 | 404 | colectiva | otras | 99999 | NaN | NaN | 99999 | 3034376 | 01-2021 | 0 | 1.0 | NaN | NaN | NaN | 2.527961e+09 | 2.285945e+09 | NaN | 0.0 | NaN | 1.998118e+09 | OTRO | MASCULINO | otras | 46.342466 |
| 64402 | 1 | 1820 | individual | otras | 99999 | NaN | NaN | 99999 | 3075907 | 01-2021 | 0 | NaN | NaN | NaN | NaN | 1.500000e+08 | 6.800000e+07 | NaN | 0.0 | NaN | 4.000000e+07 | CASADO | MASCULINO | otras | 39.698630 |
| 64403 | 1 | 3303 | s.o.a.t | otras | 1 | 97.0 | 999.0 | 0 | 4595162 | 02-2021 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N A | MASCULINO | BARRANQUILLA | 27.232877 |